to entry (0, 1). In a similar way, the edge 
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is added to entry (1,2), by virtue of rule 3 and "loves" and the edge 
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is added to entry (2,3) by virtue of the rule 2 and "fish". 

Several items are added to S2, two of which are of more interest. On the basis of item 7 
and the item corresponding to rule 1 in (0,0), the edge 
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is added to (0, 1). On the basis of item 8 in (1, 2) and item 9 in (2, 3), the following edge is 
added to (1,3): 
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This complete edge can now be used with edge 10 to form, in S3, the following edge in (0, 3): 
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and since the head of this complete edge, which spans the entire input string, is more specific 
than the initial symbol, the string "John loves fish" is accepted by the parser. 



bears the number of the rule that licenses it. The string is a sentence of the grammar since the 
derivation starts with a feature structure that is more specific than the initial symbol and ends 
with feature structures that are subsumed by the lexical entries of the input string. 
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Figure 3: A leftmost derivation 

Finally, we simulate the process of parsing with the example grammar and the input "John 
loves fish". As a matter of convention, if [i, (A, k),j] is an item, we say that the edge (A, k) is 
in the entry. We also explicitly indicate the position of the dot (denoted by '•') within 

MRSs instead of using an integer. 

The first state, So, consists of l\ ex U Ipredict', hex contains three items: the pre-terminals 
corresponding to "John", "loves" and "fish" with the dot set to in entries (0,1), (1,2) and 
(2, 3), respectively. I pre dict contains, for each entry (i, i), where < i < 3, an edge of the form 
(R, 0), where R is one of the grammar rules. Thus there are 12 items in I pre dict- 

Si contains three more items. Application of DM to the item corresponding to rule 2 in 
entry (0, 0) and to the item corresponding to "John" in (0, 1) results in the addition of the edge 
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The grammar listed in figure 2 consists of four rules and three lexical entries. The rules are 
extensions of the common context-free rules S —> NP VP, NP — ► PN and VP —> V NP. Notice 
that the head of each rule is on the right hand side of the '=>•' sign. Note also that values are 
shared among the body and the head of each rule, thus enabling percolation of information 
during derivation. 
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Figure 2: An example grammar 



A leftmost derivation of the string "John loves fish" is given in figure 3, where each derivation 
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Figure 1: A graph- and AVM- representation of a MRS 



Definition 4.23 (Off-line parsability) A grammar G is off-line parsable if there exists a 
function f from AMRSs to AMRSs such that: 

• for every AMRS A, f(A) C A 

• the range of f is finite 

• for every string w and AMRSs A,B, if there exist kA,ks, that A 1J:a —> PT w (i,j) and 
B i..k B ^ PT w (i,j) and A%B and B%A then f(A) ^ f(B). 

Theorem 4.24 If G is off-line parsable then every computation terminates. 

Proof: (sketch) Select some computation triggered by some input w of length n. We want to 
show that only a finite number of items can be generated during the computation. Observe that 
the indices that determine the span of the item are limited: < i < j < n. The location of the 
dot within each AMRS A is also limited: < k < len(A). It remains to show that only a finite 
number of edges are generated. Suppose that [i, (A,k),j] £ S is an item that was generated 
during the computation. Now suppose another item is generated where only the AMRS is 
different: [i, (B,k),j]. If B □ A it will not be included in A(S) because of the subsumption 
test. There is only a finite number of AMRSs A' such that B C A (since subsumption is a 
well-founded relation). Now suppose A ^ B and B ^ A. By the parsing invariant (a) there 
exist A',B' such that A 1Jc A PT w (i,j) and B 1 k A PT w (i,j). Since G is off-line parsable, 
f(A) ^ f(B). Since the range of / is finite, there are only finitely many edges with equal span 
that are pairwise incomparable. 

Since only a finite number of items can be generated, and the states of the computation 
are such that S m C S m +\ for m > 0, a fix-point is reached within a finite number of state 
transitions. 
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Proof:By induction on m. Base: for all items z £ Ipredict, iz = jz an d the proposition obtains 
(vacuously). For all items z £ I\ ex , I = 1 and A z = PT(i z + l,j z ). 

If m > 0: Let a; = DM(x, y) where a; = [i^, (Ac, k x ),j x ],y = [i y , (A y ,ky},j y ] and a;, t/ £ Items. 
(Ay, fey) is complete, hence by 4.16 i y < j y and by the induction hypothesis and the completion 
theorem, A y y PT(i y + l,j y )- Also, len(A z ) > 1 since all rules are of length > 1. By 
the induction hypothesis, A^ kx PT(i x + l,j x ). Therefore A\" kx PT(i x + l,j x ). Since 
A k y y A PT(i y + l,j y ), we get A k * +1 A PT(i y + l,j y ). Therefore A\~ h - ^ PT(i z + l,j z ). 

Corollary 4.19 If a computation triggered by w = w\ , . . . , w n is successful then w £ 1(G). 

Proof: For a computation to be successful there must be a state S m that contains some item 
[0, {A, k — l),n] where k = len(A) and Abs(A s ) C A k . From the above theorem it follows that 
A i..k-i PT(l,n). Since A is complete, by the completion theorem A k PT(1, n), and thus 
wi, . . . , w n £ L{G). 

4.3.2 Completeness 

The following lemma shows that one derivation step, licensed by a rule R of length r + 1, 
corresponds to r applications of the DM function, starting with an item that predicts R and 
advancing the dot r times, until a complete item is generated. 

Lemma 4.20 If A —>■ B and there exists m > such that for every 1 < b < len(B) there exists 
xi £ S m such that xi = [ij, (Bj,, fcj), jj] is complete and B kh+1 = B b , and i\ = 0, = ij, for 
< b < len(B) and i/ en ^^ = n , then there exists m! > such that for every 1 < a < len(A) 

there exists y a £ S m i such that y a = [i a , (A a , k a ), j a ] is complete and A k a A+1 = A a and i\ = 
0,j a _i = i a forO < a< len(B) and 3i en (B) = n ' 

Proof: (sketch) A B by some rule R of length r + 1 that expands the p-th element of A to the 
elements q\, . . . , q r in B. For all elements of A except p, the proposition holds by assumption. 
Since R is a rule, there exists an item xr £ I pre dict that xr = [i qi , (R, 0), i qi ] where i qi is the 
first index of x qi . Let y\ = DM(xr, x qi ) and y\ = DM(yi-\,x qi ) for 1 < / < r. All the t/ 
items exist: by the requirements on the indices of the ajj-s, the indices of the y items fit. The 
unifications performed by DM don't fail: if they would, A wouldn't derive B. Then y r £ S m + r 
is complete (as there were exactly r applications of DM) and y r r +1 = A p . 

Theorem 4.21 (Parsing invariant (b)) If A PT(i+l, j) fori < j then there exist m > 
and x £ S m such that x = [i, (B,k — 1) , j] where k = len(B) and B k = A. 

Proof:By induction on d, the number of derivation steps. Base: if d = then A — ► PT(i+ 1, j) 
iff _PT(i + 1, j) £ 7; e:c , in which case an item as required exists. If d > 1, an immediate 
application of the above lemma to the induction hypothesis gives the required result. 

Corollary 4.22 If w = w\, . . . , w n £ 1(G) then the computation triggered by w is successful. 

Proof: Since w\, . . . , w n £ 1(G), there exists an AFS A □ Abs(A s ) such that A -> PT(l, n). 
By the parsing invariant, there exist m > and a; £ 5™, a; = [0, (B, k — l),n] where k = len(B) 
and A = Abs(A s ) Q A = B k and therefore the computation is successful. 

4.3.3 Termination 

It is well-known (see, e.g. ,[9, 11]) that unification-based grammar formalisms are Turing-equival- 
ent, and therefore decidability cannot be guaranteed in the general case. This is also true for 
the formalism we describe here. However, for grammars that satisfy a certain restriction, 
termination of the computation can be proven. The following definition is an adaptation of the 
one given in [11]. 



Definition 4.10 (Dot movement) The partial function DM : Items x Items — ► Items is 
defined as follows: DM([i, (A, 1«a), Ia], [Ib, (B, ks), j]) = [i, (C,kc), j], where: 

• l A = l B , n = len(A), m = len(B) 

• Ica < n — 1 (^Ae erfge (A, k A ) is active), ks = m — 1 (^Ae erfge (B,ks) is complete), 
kc = k A + 1 

• C* = (A, + 1) U 5 m fC «s obtained from A by unifying the element of A succeeding the 
dot with the head of B) 

DM is not defined if I a ^ Ib, if the edge in its first argument is complete, if the edge in its 
second argument is active, or if the unification fails. 

Lemma 4.11 If x = [i x , (A x , k x ),j x ] £ Items, y = [i y , (A y , k y ),j y ] £ Items and z = DM(x, y) 
is defined, then z = [i z , (A z , k z ) , j z ] where i z = i x ,j z > i z ,A z y A x and k z > k x . 

Corollary 4.12 Ifx,y£ Items then DM(x,y) £ Items if it is defined. 

To compute the next state, new items are added if they result by applying DM to exist- 
ing items, unless the result is more specific than existing items. This is a realization of the 
subsumption check suggested in [11, 12]. 

Definition 4.13 (Ordering items) If x = [i x , (A x , k x ), j x ] and y = [i y , (A y ,k y ),j y ] are items, 
x subsumes y (written x < y) iff i x = i y ,j x = j y , k x = k y and A x ■< A y . 

Definition 4.14 let A(S) = {z \ z = DM(x,y) for some x,y £ S and there does not exist 
z' G S such that z' ■< z}. The transition relation 1~ ' holds between two states S and S' ( denoted 
by 5 h 5"; ifS' = SU A(5). 

Definition 4.15 (Computation) A computation is an infinite sequence of states Si,i > 0, 
such that So = S and for every i > 0, Si h Si+i. A computation is terminating if there exists 
some m > for which S m = S m +i (i.e., a fix-point is reached). A computation is successful 
if one of its states contains an item of the form [0, (A, k — l),n] where n is the input length, 
k = len(A) and Abs(A s ) C A k ; otherwise, the computation fails. 

4.3 Proof of Correctness 

In this section we show that parsing, as defined above, is (partially) correct. First, the algo- 
rithm is sound: computations succeed only for input strings that are sentences of the language. 
Second, it is complete: if a string is a sentence, it is accepted by the algorithm. Then we show 
that the computation terminates for off-line parsable grammars. 

4.3.1 Soundness 

Lemma 4.16 If [i, (A,k), j] 6 S m for some m > then i = j only if (A,k) is not complete 
and k = 0. 

Proof:By induction on m. 

Theorem 4.17 (Completion) If (A,k) is a complete edge, len(A) > 1 and A 1J: B then 
A k+1 A B. 

Proof: Since (A, k) is an edge such that len(A) > 1, there exists an abstract rule R such that 
R C A. Hence (A, k + 1) U R k+1 = A k+1 , (R, k + 1) U A k+1 = A and A k+1 A 1Jc . Since 
A 1Jc A B we obtain A k+1 A B. 

Theorem 4.18 (Parsing invariant (a)) If z = [i z , (A z , k z ) , j z ] £ S m for some m > 0, / = 
len(A z ) and i z < j z , then A\~ k ' A PT(i z + l,j z ) ifl>\, A\ ^ PT(i z + l,j z ) if I = 1. 



• B can be obtained by replacing the j-th element of A' with the body of R' , 5 
'—>■' is the reflexive transitive closure of . 

Intuitively, A derives B through some AFS A 3 in A, if some rule p £ 1Z licenses the derivation. 
A 3 is unified with the head of the rule, and if the unification succeeds, the (possibly modified) 
body of the rule replaces A 3 in A. 

Definition 4.5 (Language) The language of a grammar G is L(G) = {w = w\ ■ ■ ■ w n £ 

Words* | A A B for some A □ Abs(A s ) and B □ PT w (l, n)}. 

Figure 3 shows a sequence of derivations, starting from some feature structure that is more 
specific than the initial symbol and ending in a sequence of structures that can stand for the 
string "John loves fish", based upon the example grammar. 

4.2 Parsing as Operational Semantics 

We view parsing as a computational process endowing TFS formalisms with an operational 
semantics, which can be used to derive control mechanisms for an abstract machine we de- 
sign ([14]). A computation is triggered by some input string of words w = w\ ■ ■ ■ w n of length 
n > 0. For the following discussion we fix a particular input string w of length n. A state of the 
computation is a set of items, and states are related by a transition relation. The presentation 
below corresponds to a pure bottom-up algorithm. 

Definition 4.6 (Dotted rules) A dotted rule (or edge) is a pair (A, k) where A = Abs(a) 
is an AMRS such that p C a for some p £ 1Z and where < k < len(A). An edge (A,k) is 
complete if k = len(A) — 1; an edge is active otherwise. 

A dotted rule consists of an AMRS A that is more specific than (the abstraction of) some 
grammar rule, and a number k that denotes the position of the dot within A. The dot can 
precede any element of A (in the case of lexical rules, it can also succeed the rule). 

Definition 4.7 (Items) An item is a triple [i, (A,k),j] where < i < j < n and (A,k) is a 
dotted rule. An item is complete if the edge in it is complete. Let Items be the collection of all 
items. 

During parsing, the intuitive meaning of an item is that the part of A prior to the dot (which 
is indicated by k) derives a substring of the input, and if it can be shown that the part of 
A succeeding the dot derives a consecutive substring of the input, then the head of A derives 
the concatenation of the two substrings. This invariant is formally defined and proven in the 
section 4.3. i and j indicate the span of the item. 

A computation is determined by a sequence of states, each of which is a collection of items, 
where the first state corresponds to the initialization and each subsequent state contains its 
predecessor and is related to it by the transition relation. 

Definition 4.8 (States) A state S C Items is a finite set of items. 

Definition 4.9 (Initialization) Let S = L\ ex U l vr edict be the initial state, where: 

Ilex = {[i - 1, (Ai, 0), i] | 1 < i < n and Ai = PT w (i, i)} 

Ipredict = {[i, (Abs(p), 0), i] | < i < n and p £ U} 

Ii ex contains the (complete) items that correspond to categories of the input words, whereas 
Ipredict contains an (active) item for each grammar rule and a position in the input string. 



5 The exact details can be found in [15]. 



• K, C = ~A U{((j, TTi), (j, 7T 2 )) | TTi « B 7T 2 } 

TAe unification fails if there exists some pair (i, ir) £ II c" smc/j <Aa< 0c/ (i, 7r) = T. 

Many of the properties of AFSs, proven in the previous section, hold for AMRSs, too. In 
particular, if A is an AMRSs then so is (A, j) UB if it is defined, len((A,j) U B) = len(A) and 
(AJ)UB^ A. 

4 Parsing 

Parsing is the process of determining whether a given string belongs to the language defined 
by a given grammar, and assigning a structure to the permissible strings. We formalize and 
explicate some of the notions of [3, chapter 13]. We give direct definitions for rules, grammars 
and languages, based on our new notion of AMRSs. This presentation is more adequate to 
current TFS-based systems than [7, 12], that use first-order terms. Moreover, it does not 
necessitate special, ad-hoc features and types for encoding trees in TFSs as [11] does. We don't 
assume any explicit context-free back-bone for the grammars, as does [13]. 

We describe a pure bottom-up chart-based algorithm. The formalism we presented is aimed 
at being a platform for specifying grammars in HPSG, which is characterized by employing 
a few very general rules; selecting the rules that are applicable in every step of the process 
requires unification anyhow. Therefore we choose a particular parsing algorithm that does not 
make use of top down predictions but rather assumes that every rule might be applied in every 
step. This assumption is realized by initializing the chart with predictive edges for every rule, 
in every position. 

4.1 Rules and Grammars 

We define rules and grammars over a fixed set Words of words. However, we assume that the 
lexicon associates with every word w a feature structure C(w), its category, 1 so we can ignore 
the terminal words and consider only their categories. The input for the parser, therefore, is a 
sequence 2 of TFSs rather than a string of words. 

Definition 4.1 (Pre-terminals) Let w = w\ . . .w n £ Words* and Ai = C{wi) for 1 < i ' < 
n. PT w (j,k) is defined if I < j < k < n, in which case it is the AMRS Abs((Aj , Aj + i, . . .,Ak}). 
Note that PT w (j,k) ■ PT w (k + l,m) = PT w (j,m). We omit the subscript w when it is clear 
from the context. 

Definition 4.2 (Rules) A rule is a MRS of length n > 1 with a distinguished last element. 
If {A\, . . . ,A n -\,A n ) is a rule then A n is its head 3 and {A\, . . .,A n _i) is its body. 4 We write 
such a rule as (Ai , . . . , A n _\ =>• A n ) . In addition, every category of a lexical item is a rule 
(with an empty body). We assume that such categories don't head any other rule. 

Definition 4.3 (Grammars) A grammar G = (1Z, A s ) is a finite set of rules 1Z and a start 
symbol A s that is a TFS. 

An example grammar, whose purpose is purely illustrative, is depicted in figure 2. For the 
following discussion we fix a particular grammar G = (1Z, A s ). 

Definition 4.4 (Derivation) An AMRS A = {Ind A ,U A ,Q A ,^ A ) derives an AMRS B (de- 
noted A — ► B) if there exists a rule p £ 1Z with len(p) = n and R = Abs(p), such that 

• some element of A unifies with the head of R: there exist AMRSs A' , R' and j £ Ind A 
such that A' = (A, j) U R n and R' = (R, n) U A' 

1 Ambiguous words are associated with more than one category. We ignore such cases in the sequel. 

2 We assume that there is no reentrancy among lexical items. 

3 This use of head must not be confused with the linguistic one, the core features of a phrase. 

4 Notice that the traditional direction is reversed and that the head and the body need not be disjoint. 



. Ind a = 



. H CT = {(i,7r) | 6(g,-,7r)J.} 
. <T (i» = (9(«(g i ,7r)) 

• (i, 7Tl) « CT (j, 7T 2 ) iff ($(</;, 7Tl) = 6(qj,TT 2 ) 



It is easy to see that j4&s(<t) is an AMRS. In particular, notice that for every i £ Ind a there 
exists a path 7r such that (i, tt) 6 n<j since for every i,b(qi,e){. The reverse operation, Cone, 
can be defined in a similar manner. 

AMRSs are used to represent ordered collections of AFSs. However, due to the possibility 
of value sharing among the constituents of AMRSs, they are not sequences in the mathematical 
sense, and the notion of sub-structure has to be defined in order to relate them to AFSs. 

Definition 3.4 (Sub-structures) Let A = (IndA, 11^, 0^, ~^}/ let Ind B be a finite (contigu- 
ous) subsequence of IndA,' let n+l be the index of the first element ofInd B . The sub-structure 
of A induced by Ind B is an AMRS B = (Ind B , II5, Q B , Ki B ) such that: 

• (i — n, 7r) G lis iff i £ Ind B and (i, tt) £ A 

• 0s(* — n, it) = 7r) if i £ Ind B 

• (ii - n, 7Ti) & B (i 2 - n, 7r 2 ) iff ii £ Ind B ,i 2 £ lnd B and , tti) &a (h, ^2) 

A sub-structure of A is obtained by selecting a subsequence of the indices of A and considering 
the structure they induce. Trivially, this structure is an AMRS. We use A J " k to refer to the 
sub-structure of A induced by (j, . . . , k) . If Ind B = {«}, A l A can be identified with an AFS, 
denoted A 1 . 

The notion of concatenation has to be defined for AMRSs, too: 

Definition 3.5 (Concatenation) The concatenation of A = (IndA, 11^, 0^, &a) and B = 

(Ind B , H B , Q B , Ki B ) of lengths riA,n B , respectively (denoted by A ■ B), is an AMRS C = 
(Indc, lie 0c, ~c) such that 

• Ind c = (1, • • • , n A + n B ) 



• k, c = tx A U{((ii + n A ,TTi), (i 2 + n A ,K 2 j) \ (h, fi) ~s (h,^)} 

We now extend the definition of unification to AMRSs: we want to allow the unification 
of two AFSs, one of which is a part of an AMRS. Therefore, one operand is a pair consisting 
of an AMRS and an index, specifying some element of it, and the second operand is an AFS. 
Recall that due to reentrancies, other elements of the AMRS can be affected by this operation. 
Therefore, the result of the unification is a new AMRS. 

Definition 3.6 (Unification in context) Let A = (IndA, ^A, ©A, ~a) be an AMRS, B = 
(n B ,Q B ,~ B ) an AFS. (A, j) U B is defined if j £ IndA, in which case it is the AMRS C" = 



• n c = Il A U {(i + n A , tt) I (i, tt) £ n B } 




Ty(Eq(Cl({Indc,nc,ec,^c)))), where 



• Indc = LndA 



. n c = n A u{(j,7t) I tt £ n B } 




Meta-variables a, p range over MRSs, and S, Q and Q over their constituents. If (Q, G) is a 
MRS and qi is a root in Q then q~i naturally induces a feature structure Pr(Q, i) = (Qi, qi, Si), 
where Qi is the set of nodes reachable from qi and Si = 

One can view a MRS (Q, G) as an ordered sequence (A\ , . . . , A n ) of (not necessarily disjoint) 
feature structures, where Ai = Pr(Q, i) for 1 < i < n. Note that such an ordered list of feature 
structures is not a sequence in the mathematical sense: removing an element from the list 
effects the other elements (due to reentrancy among elements). Nevertheless, we can think of 
a MRS as a sequence where a subsequence is obtained by taking a subsequence of the roots 
and considering only the feature structures they induce. We use the two views interchangeably. 
Figure 1 depicts a MRS and its view as a sequence of feature structures. 

A MRS is well-typed if all its constituent feature structures are well-typed, and is totally 
well-typed if all its constituents are. Subsumption is extended to MRSs as follows: 

Definition 3.2 (Subsumption of multi-rooted structures) A MRS a = (Q,G) subsu- 
mes a MRS a' = (Q',G') (denoted by a El a') if \Q\ = \Q'\ and there exists a total function 
h : Q — ► Q' such that: 

• for every root qi £ Q, h(q~i) = q\ 

• for every q £ Q, 6(q) C 6' (h(q)) 

• for every q £ Q and f £ Feats, if 6(q, f)l then h(6(q, /)) = S'(h(q), f) 

We define abstract multi-rooted structures in an analog way to abstract feature structures. 

Definition 3.3 (Abstract multi-rooted structures) A pre- abstract multi rooted stru- 
cture (pre-AMRS) is a quadruple A = (Ind, II, 0, ps), where: 

• Ind, the indices of A, is the sequence (1, . . . , n) for some n 

• II C Ind x Paths is a non-empty set of indexed paths, such that for each i £ Ind there 
exists some tt £ Paths that (i, ir) £ II. 

• : II — ► Types is a total type-assignment function 

• «C II x II is a relation 

An abstract multi-rooted structure (AMRS) is a pre-AMRS A for which the following 
requirements, naturally extending those of AFSs, hold: 

• II is prefix- closed: if (i, ira) £ II then (i, ir) £ II 

• A is fusion-closed: if (i,Tra) £ II and (i',7r'a') £ II and (i, ir) ps (*',""') then (i, ira') £ II 
(as well as (i',7r'a) EH), and (i, ira') ps (i',7r'a') (as well as (i',7r'a) ps (i, to) j 

• P» is an equivalence relation 

• respects the equivalence: «/(ii,7Ti) ps (12,^2) then 0(ii,7Ti) = ©(12,^2) 

An AMRS (Ind, II, 6, «) is well-typed if for every (i, w) £ II, 6(i, w) ^ T and if (i, wf) £ II then 
Approp(f, 0(i, 7r))|. and Approp(f,Q(i,ir)) C 0(i,7r/). It is totally well typed if, in addition, 
for every (i, tt) £ II, if Approp(f, 0(i, 7r)) J, then (i,irf) £ II. The length of an AMRS A is 
len(A) = \Ind A \. 

The closure operations CI and Eq are naturally extended to AMRSs: If A is a pre-AMRS 
then Cl(A) is the least extension of A that is prefix- and fusion-closed, and Eq(A) is the 
least extension of A to a pre-AMRS in which ps is an equivalence relation. In addition, 
Ty((Ind, II, 0, ps}) = (Ind, II, 0', p») where 0'(i, ir) = LJ^-, ^r^^ w j ©(«', tt'). The partial order 
^ is extended to AMRSs: (Ind A , Ha, 0a, ~a) d: (Inds, Hb, 0b, ~b) iff IndA = Inds, Ha Q 
Hb,~aQ~b and for every (i, w) £ H A , 6a(', t) E ©b(«, "")• 

AMRSs, too, can be related to concrete ones in a natural way: If a = (Q, G) is a MRS then 
Abs(a) = (Ind a , H a , CT , rs ct ) is defined by: 



• K, C —K, A U 



The unification fails if there exists a path tt £ II c" such that Qc'i^) = T. 

Lemma 2.9 CI preserves prefixes: If A is a prefix-closed pre-AFS and A' = Cl(A) then A' is 
prefix-closed. 

Lemma 2.10 Eq preserves prefixes and fusions: If A is a prefix- and fusion-closed pre-AFS 
and A' = Eq(A) then A' is prefix- and fusion-closed. 

Corollary 2.11 If A and B are AFSs, then so is A U B. 

C" is the smallest AFS that contains lie and Fie- Since 11^ and II 5 are prefix-closed, so is 
He- However, lie and k,q might not be fusion-closed. This is why CI is applied to them. As 
a result of its application, new paths and equivalence classes might be added. By lemma 2.9, 
if a path is added all its prefixes are added, too, so the prefix-closure is preserved. Then, Eq 
extends ps to an equivalence relation, without harming the prefix- and fusion-closure properties 
(by lemma 2.10). Finally, Ty sees to it that respects the equivalences. 

Lemma 2.12 Unification is commutative: A U B = B U A. 

Lemma 2.13 Unification is associative: (A U B) U C = A U (B U C). 

The result of a unification can differ from any of its arguments in three ways: paths that 
were not present can be added; the types of nodes can become more specific; and reentrancies 
can be added, that is, the number of equivalence classes of paths can decrease. Consequently, 
the result of a unification is always more specific than any of its arguments. 

Theorem 2.14 If C = AU B then A < C . 

TFSs (and therefore AFSs) can be seen as a generalization of first-order terms (FOTs) 
(see [1]). Accordingly, AFS unification resembles FOT unification; however, the notion of 
substitution that is central to the definition of FOT unification is missing here, and as far as we 
know, no analog to substitutions in the domain of feature structures was ever presented. 

3 Multi-rooted Structures 

To be able to represent complex linguistic information, such as phrase structure, the notion of 
feature structures has to be extended. HPSG does so by introducing special features, such as 
DTRS (daughters), to encode trees in TFSs. This solution requires a declaration of the special 
features, along with their intended meaning; such a declaration is missing in [10]. An alternative 
technique is employed by Shieber ([11]): natural numbers are used as special features, to encode 
the order of daughters in a tree. In a typed system this method necessitates the addition of 
special types as well; theoretically, the number of features and types necessary to state rules is 
unbounded. 

As a more coherent, mathematically elegant solution, we define multi-rooted structures, 
naturally extending TFSs. These structures provide a means to represent phrasal signs and 
grammar rules. They are used implicitly in the computational linguistics literature, but to the 
best of our knowledge no explicit, formal theory of these structures and their properties was 
formulated before. 

Definition 3.1 (Multi-rooted structures) A multi-rooted feature structure (MRS) is a 
pair (Q, G) where G = (Q, 8} is a finite, directed, labeled graph consisting of a set Q C Nodes of 
nodes and a partial function 8 : Q x Feats — ► Q specifying the arcs, and where Q is an ordered, 
non-empty (repetition-free) list of distinguished nodes in Q called roots. G is not necessarily 
connected, but the union of all the nodes reachable from all the roots in Q is required to yield 
exactly Q. The length of a MRS is the number of its roots, \Q\. 



An AFS (II, 0, ?s) is well-typed if Q(tt) ^ T for every 7r 6 II and if irf G II then Approp(f, 0(tt))J. 
and Approp( f, Q(tt)) C 0(7r/). It is totally well typed if, in addition, for every 7r £ II, if 
Approp(f, 0(tt))J. then 7r/ G II. 

Abstract features structures can be related to concrete ones in a natural way: If A = (Q, q, 8) 
is a TFS then j4&s(j4) = (ILa, 0^, ps^} is defined by: 



It is easy to see that Abs(A) is an abstract feature structure. 

For the reverse direction, consider an AFS A = (II,0,ps). First construct a 'pseudo-TFS', 
Conc(A) = (Q,q,8), that differs from a TFS only in that its nodes are not drawn from the 
set Nodes. Let Q = {q[ w ] \ [it] G [~]}- Let 9(q^) = 0(-7r) for every node - since A is an 
AFS, respects the equivalence and therefore 9 is representative-independent. Let q = and 
HQ[tt]j f) = Q[irf] f° r every node and feature /. Since A is fusion-closed, 8 is representative- 
independent. By injecting Q into Nodes making use of the richness on Nodes, a concrete 
TFS Conc(A) is obtained, representing the equivalence class of alphabetic variants that can 
be obtained that way. We abuse the notation Conc(A) in the sequel to refer to this set of 
alphabetic variants. 

Theorem 2.2 If A' G Conc(A) then Abs(A') = A. 

AFSs can be partially ordered: (ILajQa^a) d: (H_b, 0_b, ~_b) iff II ^ C T1b,~aQ~b and 
for every 7r G n^,0^(7r) C 0s(7r). This order corresponds to the subsumption ordering on 
TFSs, as the following theorems show. 

Theorem 2.3 A C B iff Abs(A) < Abs(B). 

Theorem 2.4 For every A G Conc(A'),B G Conc(B'),AQ B iff A' < B' . 
Corollary 2.5 A ~ B iff Abs(A) = Abs(B). 
Corollary 2.6 Conc(A') ~ Conc(B') iff A = B. 

2.2 Unification 

As there exists a one to one correspondence between AFSs and (alphabetic variants of) concrete 
ones, we define unification over AFSs. This leads to a simpler definition that captures the 
essence of the operation better than the traditional definition. We use the term 'unification' to 
refer to both the operation and its result. 

Definition 2.7 (Closure operations) Let CI be a fusion- closure operation on pre-AFSs: 
Cl(A) = A', where A' is the least extension of A to a fusion-closed structure. Let Eq((fl,0,& 
)) = (II, 0, r;')) where is the least extension of ps to an equivalence relation. Let Ty((fl, 0, ps 



)) = (II, 0', «) where 0'(tt) = U^,^ 0(tt). 

Definition 2.8 (Unification) The unification A U B of two AFSs A = (ILajQa^a) an d 
B = (fl B ,e B ,^B) is an AFS C" = Ty(Eq(Cl(C))), where: 

. c = (n c ,e c ,«c) 
• n c = n i un B 



. IU = {7r|%-,7r)j} 

• 7Tl tt A 7T 2 iff 8{q, 7Tl) = 8(q, 7T 2 ) 




• Formalization and explication of the notion of multi-rooted feature structures that are 
used implicitly in the computational linguistics literature; 

• Concise definitions of a TFS-based linguistic formalism, based on abstract MRSs; 

• Specification and correctness proofs for parsing in this framework. 

2 Theory of Feature Structures 

2.1 Types, Features and Feature Structures 

We assume familiarity with the theory of TFS as in [3, chapters 1-6], and only summarize some 
of its preliminary notions. When dealing with partial functions the symbol 1 f{x) J,' means that 
/ is defined for the value x and the symbol 'f means undefinedness. Whenever the result of an 
application of a partial function is used as an operand, it is meant that the function is defined 
for its arguments. 

For the following discussion, fix non-empty, finite, disjoint sets Types and Feats of types 
and feature names, respectively. Let Paths = Feats* denote the collection of paths, where 
Feats is totally ordered. Fix also an infinite set Nodes of nodes and a typing function 9 : 
Nodes —> Types. The set Nodes is 'rich' in the sense that for every t £ Types, the set {q £ 
Nodes | 8{q) = t} is infinite. We use the bounded complete partial order C. over Types x Types 
to denote the type hierarchy, and the partial function Approp : Feats x Types — ► Types 
to denote the appropriate specification. 

A feature structure is a directed, connected, labeled graph consisting of a finite, nonempty 
set of nodes Q C Nodes, a root q £ Q, and a partial function 6 : Q x Feats — ► Q specifying the 
arcs such that every node q £ Q is accessible from q. We overload 'C' to denote also subsumption 
of feature structures. Two feature structures A\ and are alphabetic variants (A\ ~ A2) 
iff Ai C A 2 and A 2 C. A x . 

Alphabetic variants have exactly the same structure, and corresponding nodes have the same 
types. Only the identities of the nodes distinguish them. The essential properties of a feature 
structure, excluding the identities of its nodes, can be captured by three components: the set 
of paths, the type assigned to every path, and the sets of paths that lead to the same node. 
In contrast to other approaches (e.g., [3]), we first define abstract feature structures and then 
show their relation to concrete ones. The representation of graphs as sets of paths is inspired by 
works on the semantics of concurrent programming languages, and the notion of fusion-closure 
is due to [4]. 

Definition 2.1 (Abstract feature structures) A pre- abstract feature structure (pre- 
AFS) is a triple (II,0,ps), where 

• II C Paths is a non-empty set of paths 

• : II — ► Types is a total function, assigning a type to every path 

• psC II x II is a relation specifying reentrancy (with [rs] the set of its equivalence classes) 
An abstract feature structure (AFS) is a pre-AFS for which the following requirements hold: 

• II is prefix- closed: if it a £ II then tt £ II (where 7r, a £ Paths ) 

• A is fusion-closed: if it a £ II and ir'a' £ II and tt pa ir' then ira' £ II, 0(7ra') = 0(7r'a') 
( as well as ir'a £ II, 0(7r'a) = 0(7ra) ), and ira' ?a ir'a' ( as well as ir'a ps 7ra ) 

• p» is an equivalence relation with a finite index 

• respects the equivalence: if it\ ~ 1^2 then Q(tti) = ©(7^) 
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Abstract 

In this paper we provide for parsing with respect to grammars expressed in a general 
TFS-based formalism, a restriction of ALE ([2]). Our motivation being the design of an 
abstract (WAM-like) machine for the formalism ([14]), we consider parsing as a compu- 
tational process and use it as an operational semantics to guide the design of the control 
structures for the abstract machine. 

We emphasize the notion of abstract typed feature structures (AFSs) that encode 
the essential information of TFSs and define unification over AFSs rather than over TFSs. 
We then introduce an explicit construct of multi-rooted feature structures (MRSs) 
that naturally extend TFSs and use them to represent phrasal signs as well as grammar 
rules. We also employ abstractions of MRSs and give the mathematical foundations needed 
for manipulating them. We then present a simple bottom-up chart parser as a model for 
computation: grammars written in the TFS-based formalism are executed by the parser. 
Finally, we show that the parser is correct. 

1 Introduction 

Typed feature structures (TFSs) serve for the specification of linguistic information in current 
linguistic formalisms such as HPSG ([10]) or Categorial Grammar ([8]). They can represent 
lexical items, phrases and rules. Usually, no mechanism for manipulating TFSs (e.g., parsing 
algorithm) is inherent to the formalism. Current approaches to processing HPSG grammars 
either translate them to Prolog (e.g., [2, 5, 6]) or use a general constraint system ([16]). 

In this paper we provide for parsing with grammars expressed in a general TFS-based 
formalism, a restriction of ALE ([2]). Our motivation is the design of an abstract (WAM-like) 
machine for the formalism ([14]); we consider parsing as a computational process and use it as 
an operational semantics to guide the design of the control structures for the abstract machine. 
In this paper the machine is not discussed further. 

Section 2 outlines the theory of TFSs of [1, 3]. We emphasize abstract typed feature 
structures (AFSs) that encode the essential information of TFSs and extend unification to 
AFSs. Section 3 introduces an explicit construct of multi-rooted feature structures (MRSs) 
that naturally extend TFSs, used to represent phrasal signs as well as grammar rules. Abstrac- 
tion is extended to MRSs and the mathematical foundations needed for manipulating them is 
given. In section 4 a simple bottom-up chart parser for the TFS-based formalism is presented 
and shown correct. The appendix contains examples of MRSs and grammars as well as a sim- 
ulation of parsing. Due to space limitations we replace many proofs by informal descriptions 
and examples; the formal details are given in [15]. The main contributions of this paper are: 



